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We adopt a physically motivated empirical approach to the characterisation of the distributions 
of twin and triplet primes within the set of primes, rather than in the set of all natural numbers. 
Remarkably, the occurrences of twins or triplets in any finite sequence of primes are like fixed- 
probability random events. The respective probabilities are not constant, but instead depend on 
the length of the sequence in ways that we have been able to parameterise. For twins the "decay 
constant" decreases as the reciprocal of the logarithm of the length of the sequence, whereas for 
triplets the falloff is faster: decreasing as the square of the reciprocal of the logarithm of the 
number of primes. The manner of the decrease is consistent with the Hardy-Littlewood Conjectures, 
developed using purely number theoretic tools of analysis. 
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INTRODUCTION 

Recently we discovered a novel approach to the char- 
acterisation of the distribution of twin primes and re- 
alised some of its consequences [Q, |[ ||. Our results 
have been confirmed and shown to be consistent over a 
broader range by Wolf Q|. Twins are pairs of primes 
{p,p + 2} whose arithmetic separation is minimised, i.e., 
they consist of consecutive odd natural numbers. The 
Twin Prime Conjecture posits that there are an infinite 
number of twins Q. Long ago, Hardy and Littlewood 
applied sieve arguments to establish relations which de- 
scribe the behaviour of ^(iV), the number of twins with 
constituents less than natural number N. They obtained: 
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where the so-called twin prime constant, C 2 — 0.66016 . . . 
is calculable to much greater accuracy than is quoted 
here 

Prime triplets comprise the next level of allowed struc- 
ture in the sequence of prime numbers. Again the arith- 
metic difference between the first constituent and the last 
is minimised. This minimum value is not 4 as one might 
naively suppose because any set of odd numbers of the 
form {n, n + 2, n + 4} has an element which is equivalent 
to zero(modulo 3). Thus the minimum arithmetic differ- 
ence is 6, and hence there are two "flavours" of triplets: 
those of the form {p,p + 2,p + 6}, which we call the twin- 
outlier (to) type and those of the form {p,p + 4,p + 6} 
which we shall denote outlier- twin (ot). 

Hardy and Littlewood applied their analysis to both 
flavours of triplets and determined that 
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where the triplet prime constant, C3 ~ 2.8582 ... is also 
known to great accuracy. The analytic tools used to 



construct these results are essentially insensitive to the 
flavour of the triplets and so the limits for TO and OT 
must agree [||. 

There exist analogous relations for higher order con- 
stellations of prime numbers. To place the formulae of the 
Hardy-Littlewood Conjectures in perspective, it is useful 
to express the Prime Number Theorem - that there exists 
an infinite number of prime numbers - in the following, 
stronger, form: 
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More recently (with the advent of electronic comput- 
ing), a number of investigators have studied in detail 
the actual distributions of primes and prime constella- 
tions. Particular attention has been paid to enumeration 
of twins and explicit determination of ^(N) for large 
values of N, by Nicely @, and others p, B|. In many of 
these instances, the search for twins is a beneficient ap- 
plication of research in decentralised computing. Other 
analyses have been concerned with the details of the dis- 
tribution of twins [|l(| O, [f| , and the existence and size 
of "gaps" in the sequence of twins jl3| fL6| . In all 
cases, to the best of our knowledge, these attempts sit- 
uate the twins (and higher constellations) within the set 
of natural numbers. 

There are three essential constituents of our new mod- 
els for the distributions of twins and triplets. First and 
foremost is that the distribution of twins and triplets are 
viewed in the context of the sequence of primes, not the 
natural numbers. Second is that for a prime sequence 
of length 7Ti, twins and triplets occur in the manner of 
random fixed-probability events. The third part of each 
model is that the fixed value of the probability depends 
on 7Ti, the length of the sequence, in a fairly simple man- 
ner. 
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METHOD AND RESULTS 

We generated prime numbers in sequence, viz, Pi = 
2, P 2 = 3, P 3 = 5, Pi = 7, P 5 = 11 . . ., and within this se- 
quence counted prime separations defined as the number 
of "singleton" primes occuring between adjacent pairs of 
twins, or triplets of a specified flavour. There is an ir- 
regularity with our definition of separation for the first 
few primes: 2 (3 5) (5 7), where a pair of twins is over- 
lapping, yielding an anomalous prime separation of —1. 
Fortunately, as explained above such overlapping twins 
do not ever recur and we chose to begin our sequences 
with P 3 = 5. 

many twins with prime sep- 
for example, (5 7)(11 13), 
Similarly, there are many 
i.e., (5 7 11)(11 13 17), or 
109), and thus are assigned 
Incidently, all of the prime 
6,P + 8) are 
zero prime sep- 



relative frequency of occurrence of each separation in the 
sequence of primes to N and plotting it vs. the separation 
yields a surprisingly simple linear relation as illustrated 
in Figures and §. 



Note that there are 
aration equal to zero, 
or (137 139)(149 151). 
triplets which overlap, 
(97 101 103)(103 107 
prime separation — 1. 
TT-quadruplets of the form (P, P + 2, P 
comprised of a pair of twins with 
aration. Also they may be viewed as an overlap- 
ping TO-OT flavour combination of triplets. Whereas 
OTO-quadruplets (P, P + 4, P + 6, P + 10) possess a sin- 
gle twin shared between an overlapping OT-TO pair of 
triplets. Clearly, fc-tuplets, fc > 2, or higher-order "prime 
constellations" are composed of smaller, more primitive, 
units. 

Concretely, so as to make our methods perfectly clear, 
consider the set of prime numbers greater than or equal 
to 5 and less than 100. In this range there are seven 
twins, 

{(5 7)(11 13) (17 19)(29 31) (41 43)(59 61)(71 73)} , 

and hence six separations. Two of these happen to be 0, 
three are 1, and one is 2, so the relative frequencies for 
separations s = 0,1,2 are and g, respectively. In 

the set of four TO-triplets between 5 and 100, 

{(5 7 11)(11 13 17)(17 19 23)(41 43 47)} , 

there are three separations. Two are -1, one is 3, so the 
relative frequencies are §, | for s = — 1, 3. The four OT- 
triplets less than 100, 

{(7 11 13) (13 17 19)(37 41 43)(67 71 73)} , 



have three distinct separations. The relative frequencies 
are | for each of s = —1, 3, 4. 

All separations between pairs of twins and TO- OT- 
triplets up to N were computed and tabulated. [We chose 
values of N which ranged from less than 10 5 , the limit 
of poor statistics, to 6 x 10 12 , the limit of our patience.] 
We also obtained iri(N) for each of these ranges and the 
count of the number of singleton primes that occur above 
the last tuple in the range. Taking the logarithm of the 
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FIG. 1: Distribution of twin prime separations for N = lx 10 
and N = 1 X 10 9 . Note the (approximately) linear behaviour, 
and the differing slopes. 

From the figures, we see that the twins and triplets 
are occurring in a fixed-probability random manner, but 
that the probability diminishes as the length of the se- 
quence of primes increases. By way of analogy, consider 
radioactive decay. The likelihood of an atom decaying 
in any short time interval is constant for a particular 
substance, and thus the probability that the next decay 
observed in a sample occurs at time t is proportional to 
exp(— 7t), where 7, the decay rate, is a property of the 
species of atom. The values of the best-fit slopes that 
we measure from figures like those above determine "de- 
cay constants" for the twins and triplet flavours. We will 
also find it useful to think of the mean separation, the 
expected number of singleton primes appearing between 
adjacent fc-tuplets, corresponding to the mean lifetime of 
the radioactive substance. As expected from our analogy, 
s = l/slope. In our reformulation of our twin model ||, 
and our analysis of triplets fL2| , in terms of the counts 
tti(N) and ^2,3 (JV) alone, we cast our analysis in terms 
of 5. 

The linear fits that we employed were constrained to 
ensure that the relative frequencies are properly nor- 
malised. With negligible deleterious effects we treat s 
as a continuous variable and integrate over all possible 
separations: from the minimum value possible to infin- 
ity. For twins, we must have 

+ (intercept) = ln(— (slope)) , f(s) = — ms + \n(m) , (4) 
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FIG. 2: Distribution of triplet prime separations for TV = 
1 x 10 9 and TV = 1 x 10 12 . The distributions for the two 
flavours overlap to such an extent that they are virtually in- 
distinguishable. Note the (approximately) linear behaviour, 
the differing slopes, and the difference in scale from the twins 
case. 



since the minimum separation is 0. For triplets, we have 
a similar condition 



data. 

As the fixed probabilities are seen to change with m, 
we model the manner in which they vary. In the figures 
below, we present the estimated slopes (with statistical 
errors only, the total error is expected to be somewhat 
larger), vs. \og(ni(N)) - with correction for the singleton 
primes beyond the last tuplet - and functions that we 
believe capture very well the behaviour of the "decay 
constant." 
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f(s) = -m(s + 1) + ln(m) , 



(5) 



on account of the fact that the minimum separation is 
-1. 

All of the separations which appeared in the data re- 
ceived equal (frequency-weighted) consideration in our 
computation of best-fit slopes. This has a consequence 
insofar as the large-separation, low-frequency events con- 
stituting the tail of the distribution reduce the magnitude 
of the measured slope, (see Figures [l] and |). One might 
well be inclined to truncate the data by excising the tails 
and fixing the slopes by the (more-strongly-linear) low- 
separation data for each TV. We did not do this because 
it would have entailed a generally systematic discard- 
ing of data from pairs appearing near the upper limit of 
the range, and thus would nearly correspond to the slope 
with greater magnitude that one would expect associated 
with an effective upper limit TV e g- < TV. Viewed from this 
perspective, it is better to weigh all points equally. 

Were it not the case that the magnitude of the slope 
diminished for larger values of TV, then the Hardy- 
Littlewood Conjectures would certainly be false, since 
constant slope would mean that the probability of a given 
prime being a member of a constellation is a universal 
constant. In this case then the numbers of twins and 
triplets 712,3 (TV) would just be fixed fractions of 7Ti(TV) in 
disagreement with Hardy-Littlewood and the empirical 



FIG. 3: Computed slopes vs. log(7ri(TV)), and parameterised 
fits for twins. The longer dashed line is a fit to the empir- 
ical data shown, while the shorter dashed line arises from a 
analysis of prime and twin counts only up to TV" ~ 10 . 

In light of the above comments, we sketch in Figure ||| a 
plot of the slopes for twins versus log(7Ti). The empirical 
trend seen on the graph may be well-described by the 
function (remember that the error bars are understated) 

-m{x) ~ -(1.321 ±0.008)/x , for x = log(7Ti(TV)) . 

(6) 

We note that the factor which appears in the numerator 
is approximately — 2C2, the twin primes constant, as one 
is led to expect by a straightforward argument JlJ. Also, 
the asymptotic behaviour is consistent with the Twin 
Prime Conjecture. On this graph we have also traced 
the curve which results from a discretised model for the 
distribution of twins that we developed || which requires 
only the input of 7Ti(TV) and ^(TV) and was fit to data Q] 
including values of TV up to 3 x 10 15 , well beyond the 
reach of our complete analysis of the actual distribution 
of prime separations. 

For the TO- and OT-triplets similar results ensue. 



-m TO (x) 
-m OT (x) 



-2.87±0.05 
(x+2.49±0.18) 2 

-2.81±0.04 
(x+2.24±0.17) 2 
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FIG. 4: Computed slopes vs. log(7Ti(iV)), and parameterised 
fits for TO- and OT-triplets. 



The factors which appear in the numerators of these ex- 
pressions precisely bracket the triplet prime constant in 
a manner which is consistent with expections |^2[ . The 
error that we quote arises purely from the fit to the data 
and is again most likely an understatement, so we are 
not gQIDg to axgllfi that the TO- and OT-Histrihntions are 



fundamentally dissimilar 



CONCLUSION 

We believe that we have consistently extended our con- 
struction of a novel characterization of the distribution of 
twin primes to the prime triplets. The most essential fea- 
ture of our approach is that we consider the spacings of 
twins and triplets among the primes themselves, rather 
than among the natural numbers. Secondly, we modelled 
the distribution empirically - without preconceptions - 
and found that the twins and triplets appear amongst the 
sequence of primes in a manner characteristic of a com- 
pletely random, fixed probability system. Again working 



empirically, we were able to simply parameterise the vari- 
ation of the "decay constant" in terms of 7Ti , as suggested 
by our outlook. 

Precise details of the triplets case will be reported upon 
in a forthcoming paper Jl2[ . Future work includes exten- 
sion to larger ranges of data, higher-order constellations, 
examination of constellation correlations, and possible 
fractal interpretations. 
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